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O© Antitumor protein gene of streptococcus pyogenes su, plasmids containing the gene, transformant 
£2 cells harboring the plasmids, and process for producing the antitumor protein. 

which are complementary to SAGP gene were syn- 
thesized. 



© Amino acid sequence of N-terminal region of 
yj streptococcal acid glycoprotein (SAGP), and anti- 
tumor glycoprotein produced by Streptococcus 
pyogenes Su, was determined, and DNA probes 
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Chromosomal DNA fragment of S.pvoQenes Su, 
which hybridized with those probes was cloned into 
E.coli , and a restriction map of plasmid DNA harbor- 
ing SAGP gene was revealed and upon it, DNA 
sequence or amino acid sequence of SAGP gene 
was determined. 
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AK.T.THMOR PROTEIN GENE OR STREPTOCOCCUS PYOGENES SU, PLASMIDS CONTAINING THE GENE, 
TRANS^ THE PLASMIDS AND PROCESS FOR PRODUCING THE ANTITUMOR 



PROTEIN 



RACKGROUND OF THE INVENTION 

The present invention relates to a gene of 
antitumor protein produced by Streptococcus 
pyogenes , a vector containing the gene and mi- 
croorganisms or cells transformed with the vector, 
and a process for producing the antitumor protein. 

S.ovoaenes is a Gram-positive hemolytical 
coccus which is known as a pathogen of eryspelas, 
lochiopyra, and hematospesis. It is also known that 
strains of S.ovoaenes have antitumor activities and 
the sterilized cells are now clinically used as an 
anticancer agent 

It is reported that the substance isolated from 
the ceils of S.pvoaenes Su by monitoring in vitro 
cell growth inhibition activity has been proved to 
have in vivo antitumor activity by experiments us- 
ing animals [Yoshimura, Japanese Laid-Open Pat- 
ent Publication (Kokai) No, 58-222026]. 

Yoshimura reports that this antitumor protein is 
a unique glycoprotein with a molecular weight of 
about 50,000 (analyzed by SDS-polyacrylamide gel 
electrophoresis), but its amino acid sequence has 
not yet been revealed, nor has a gene coding for 
the protein been isolated. 



SUMMARY OF THE INVENTION 

The present invention was completed by ac- 
complishing the cloning of a gene coding for the 
antitumor protein of S.ovooenes (the protein will 
hereinafter be referred to as SAGP) and determin- 
ing its total DNA sequence and amino acid se- 
quence, producing vectors containing SAGP gene 
sequence and being capable of expressing SAGP 
in host cells, cultivating host cells transformed with 
the vectors and confirming the production of SAGP 
by the host cell. The invention makes it possible to 
produce SAGP safely on a large scale without 
cultivating pathogenic microorganisms. 

The present invention provides an SAGP gene, 
DNA containing the gene, a recombinant plasmid 
containing SAGP gene and microorganisms or cells 
containing the plasmid, and a process for produc- 
ing SAGP by cultivating the microorganisms or 
cells transformed with the plasmids. SAGP so ob- 
tained can be used as an anti-tumor agent. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a restriction map of plasmid 
pSPI. Figure 2 is the base seguence of an inserted 

5 DNA fragment of pSP1 and the amino acid se- 
quence deduced from SAGP gene base sequence. 
The base number 40 -1219 region is the region 
coding for SAGP. ATG, an initiation codon coding 
for Met exists upstream of the amino acid se- 

10 quence coding for Thr of N terminal region of 
SAGP. The number of amino acids of SAGP is 
deduced to be 41 0. Figure 3 shows a brief chart for 
illustrating construction of expression vectors hav- 
ing inserted thereinto a purified DNA fragment of 

75 SAGP gene prepared by digesting pSP1 , plasmid 
containing SAGP gene, with EcoRI. 

DETAILED DESCRIPTION OF THE INVENTION 

20 

SAGP gene isolated from S.ovoaenes by the 
present invention has the DNA sequence shown in 
Fig. 2 and, by using the information on the DNA 
sequence of Fig. 2, SAGP gene can now easily be 
25 isolated from S.ovoQenes by procedures conven- 
tionally used in the field of the present invention. 

SAGP gene isolated from S.pvooenes Su is, so 
to speak, natural DNA (base) sequence. 

Today, the DNA synthesis technology makes it 
30 possible to synthesize DNA coding for a given 
protein, once the amino acid sequence of the pro- 
tein is elucidated. 

As is well know, most amino acids have more 
than one DNA base sequence or codon coding for 
35 them. Therefore, more than one DNA coding for a 
particular protein can be synthesized. 

Accordingly, in synthesizing SAGP gene cod- 
ing for the SAGP amino acid sequence in this 
invention, the DNA sequence is not limited to the 
40 natural SAGP gene DNA sequence, but contains all 
other DNA sequences coding for the SAGP amino 
acid sequence elucidated by the present inventors. 

The present gene recombination technique can 
induce artificial mutation on a specific region of 
45 DNA sequence without making any substantial 
change or with an improvement of the basic char- 
acteristic of what is encoded by the DNA se- 
quence. 

As to SAGP gene of this invention, it is possi- 
so ble to prepare SAGP genes having the same or 
better characteristic as or than that of the natural 
SAGP gene by artificial insertion, deletion or sub- 
stitution. 
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This invention encompasses such variant 
SAGP genes in addition to the natural gene. 

Expression vectors which enable the produc- 
tion of SAGP in host cells can be prepared by 
inserting SAGP gene into suitable expression vec- 
tors using procedures commonly used in the field 
of the present invention. 

Expression vectors used in this invention con- 
tain a SAGP gene sequence and a DNA sequence 
which controls the expression of SAGP gene in 
host cells upstream of the SAGP gene sequence, 
and are capable of replicating in the host cells. 

More specifically, the expression vector essen- 
tially has a promoter sequence upstream of the 
SAGP gene and the origin of replication, Le. DNA 
sequence controlling replication of the vector in a 
host cell. 

As a promotor, known promoters such as lac 
promotor, trp promotor or Ipp promotor can be 
used alone or in. combination when E.coli is used 
as a host cell. These examples are not limitative, 
however. 

In addition, the vector desirably contains a drug 
resistant gene to make it easy to select a host cell 
containing the vector. 

For example, an ampicilin resistant gene and a 
tetracyclin resistant gene can be used as the drug 
resistant gene when E.coli is used as a host cell. 

In the field of genetic engineering today, E.coli 
is most genetically analyzed and regarded as saf- 
est as host cells, and hence E.coli is most fre- 
quently used. 

For the replication of SAGP gene and produc- 
tion of SAGP, it is desirable to use Exoli , as host 
microorganism, because E.coli can multiply rapidly 
and it is easy to control expression of SAGP gene 
in E.coli . 

But it is possible to use some other kinds of 
bacteria, yeast, and so on, as a host ceil together 
with adequate expression vectors which can repli- 
cate in microorganisms to be used. 

Transformed E.coli is cultivated in a suitable 
medium and production of SAGP is confirmed -by 
the Ouchterlony method or the Western blotting 
method. 

SAGP may be isolated from the medium by 
methods conventionally used in the field of the 
invention. 

The individual steps mentioned above will be 
described in detail. 



1. Isolation of DNA fragment containing SAGP gene 
sequence: 

SAGP gene may be isolated from S.pvooenes 
5 by a conventional method when the information on 
the DNA or amino acid sequences of SAGP given 
by the invention is used. More specifically, it can 
be isolated for example, through the following pro- 
cedures: 



1) Synthesis of probes, short length DNA that is 
complementary to SAGP gene: 

75 Short length DNA's each of which has the base 

sequence complementary to the DNA sequence of 
SAGP gene described in Figure 2 are synthesized. 

It is possible to obtain adequate probes by 
determining the amino acid sequence of purified 

20 SAGP gene, deducing a DNA sequence from the 
amino acia 1 sequence and synthesizing short length 
DNAs which have the deduced DNA sequence. 

25 2) Extraction of S.pvooenes chromosomal DNA and 
digestion with restriction enzyme and fractionation. 

After cultivation of S.pyoQenes . the cells are 
lysed with an enzyme and/or a detergent, then the 
30 chromosomal DNA is extracted and purified from 
the lysed cells by a known method [J. Maromor J. 
Mol. BioL. 3, 280 (1961)]. The extracted 
chromosomal DNA is digested with a restriction 
enzyme. 

35 The digested chromosomal DNA is subjected 
to agarose gel electrophoresis, and then fractionat- 
ed according to the length of DNA fragments. 

The DNA fragments are transferred and immo- 
bilized on a nitrocellulose filter by the Southern's 

40 method [E M. Southern, J. Mol. Biol .. 98, 503 - 
(1975)]. By Southern hybridization with the RI- 
labeled DNA probes, the length of DNA fragments 
containing SAGP gene was determined. 

45 

3) Insertion of the DNA into a vector and trans- 
formation of host E.coli with the resulting recom- 
binant vector: 

so DNA fragments of such a size that contains 
SAGP gene are isolated from the digested 
chromosomal DNA fragments fractionated by 
agarose gel electrophoresis and then ligated with 
vector DNA previously digested with the same re- 

55 
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striction enzyme by DNA ligase. The ligation mix- 
ture is introduced into E.coli to give transformed 
cells. 

4) Selection of transformant carrying SAGP gene: 

The transformants obtained in 3) are grown on 
a nitrocellulose filter and lysed, and then the DNA 
fragment from lysed cells is immobilized on the 
filter. By hybridization of the immobilized DNA frag- 
ment described above with Rl-labeted probe, the 
transformed cells containing SAGP gene is iden- 
tified. 



5) Isolation of Plasmid DNA: 

The transformants determined to contain SAGP 
gene as in 4) are cultivated, and then plasmid DNA 
is isolated from them by a known method [T. 
Maniatis, E. F. Fritsch, J. Sambrook, Molecular 
Cloning, p.86, Cold Spring Harbor Laboratory - 
(1982)]. 

6) Preparation of a restriction enzyme map of plas- 
mid DNA: 

The plasmid DNA isolated in 5) is digested 
with restriction enzymes, the length of each frag- 
ment is determined by electrophoresis and a re- 
striction enzyme map of the plasmid DNA is pre- 
pared. 

7) Determination of DNA sequence of SAGP gene: 

Based upon the restriction enzyme map pre- 
pared in 6), SAGP gene DNA is digested into 
shorter length fragments with restriction enzymes 
and each fragment is cloned. Then, its base se- 
quence is determined by the Sanger method [G. F. 
Hong, Biosci. Report . 2, 907 (1982)]. 

In addition, the base sequences of the DNA 
fragments are also determined by the Maxim-Gil- 
bert method [A. Maxam and W. Gilbert, Method in 
Enzymology, 65,499 (1980)]. 

8) isolation of DNA fragment containing SAGP 
gene sequence from the plasmid vector carrying 
the gene: 

Plasmid vector containing SAGP gene se- 
quence is digested with a restriction enzyme. 



The DNA fragments of digested vector are 
fractionated by polyacrylamide gel electrophoresis, 
and a 2100 base pair DNA fragment containing 
SAGP gene is extracted and purified from the gel. 

5 

2. Construction of expression vector: 

SAGP gene is inserted into an expression vec- 

70 tor. 

For example, commercially available plasmid 
pKK 233-3 or pIN III is digested at a site down- 
stream of the promotor with restriction enzyme 
EcoRI, and mixed with DNA fragment containing 
75 SAGP gene described above and ligated with T4- 
DNA ligase. 

3. Transformation of host E.coli 

20 

The expression vector ligated with DNA con- 
taining SAGP gene is introduced into E.coli by a 
conventional transformation method. 

Plasmid DNA is prepared from the transfor- 
25 mants obtianed above and the transformant con- 
taining expression vector is selected by analyzing 
its restriction enzyme digestion pattern. 

30 4. Cultivation of transformed E.coli and induction of 
expression of SAGP gene: 

E.coli transformed by the expression vector 
described in 3 is inoculated and incubated in a 
35 liquid medium at 37° C. 

Growth of bacteria is observed by measuring 
the optical density of the culture medium and at 
the proper step of growth, expression of SAGP 
gene is induced by adding lsopropyl-0-D-thiogalac- 
40 toside (IPTG), for example. After that, the transfor- 
mants are further cultivated. 



5. Harvest of cells and isolation of SAGP 

45 

The liquid culture described in 4 is centrifuged 
to harvest the cells. 

The harvested E.coli cells are lysed with 
lysozyme, or a detergent, or by sonification, for 
so example and SAGP can be isolated from the mix- 
ture by a conventional method. 

SAGP is purified from all the lysate through 
analogous methods adopted for purification of 
SAGP from cultured cells of S.pvooenes Su; they 
55 are ammonium sulfate precipitation, Octyl- 
Sepharose CL-4B column chromatography, DE-52 
ion-exchange chromatography and TSK gel 
G3000SW gel filtration. 
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Production of SAGP is confirmed by analyzing 
the lysed mixture by the Ouchter(ony method or 
the western blotting method. 



Example 1 

1. Isolation of SAGP gene 

1) Determination of amino acid sequence of N- 
terminus of SAGP 

SAGP was purified by a known method - 
(Yoshimura t Japanese Patent Publication (Kokai) 
No. 58-222026) from S.ovooenes Su. (ATCC 
21060) and using purified SAGP 100 ug, the amino 
acid sequence of N terminus was analyzed by gas- 
phase protein sequencer (Applied Biosystems 
Model 470A). As a result the amino acid sequence 
of N-terminus of SAGP was determined as follows: 
Pro (or Thr)-AIa-Glu-Thr-Pro-lle-Unk-Val -Tyr-Unk- 
Unk-lle-Gly-Lys-Leu-Lys-Lys-Val-Leu -Leu-His-Unk- 
Pro-Gly-Lys 

In the formula, Unk means "unknown". 



2) Synthesis of Probe 

Based upon the amino acid sequence 'deter- 
mined above, 2 kinds of mixed probe P-37 and P- 
38, short length DNA fragments coding for a part of 
the amino acid sequence, were synthesized. 

The P-37 corresponds to the amino acid se- 
quence Pro-Ala-G I u-Thr- Pro-lie, and contains DNAs 
of the following sequences: 
CC(T,A)GC(T ) A)CAAAC(A,T)CC(T,A)AT(A,T)T. 

The P-38 corresponds to the amino acid se- 
quence lle-Gly-Lys-Leu-Lys-Lys-Vai, and contains 
DNAs of the following sequences: 
AT(T,A)GG(T,C)AAATT(A,G)AAAAAAGT. 

The P-37 is a mixture of 32 kinds of DNA 
sequences and P-38 is a mixture of 8 kinds of DNA 
sequences, as mentioned above. 

They were synthesized by automatic DNA syn- 
thesizer (Applied Biosystems Model 380A) to yield 
146 ug of the P-37 and 244 ug of the P-38. 

These probe DNA were ^P-labeled as follows 
just before the hybridization. 

Synthetic probe 5pM was incubated with T4- 
polynucleotide kinase and (7 32 p)-ATP30 Ci at 37 °C 
for 30 minutes. 

After the reaction product was adsorbed on a 
small amount of DE-52, and nonreacted (t^P^ATP 
was washed away with 0.1M-NaCI solution, 32 p~ 
labeled probe was extracted from the DE-52 with 
1 M-NaCI solution. 



3) Extraction and digestion with restriction enzyme 
of chromosomal DNA of S.pvooen $ Su 

S.pvoaenes Su (ATCC 21060) was cultivated in 
5 L culture medium 300 ml at 37°C for 15 hours, and 
then cells were harvested by centrifugation. 

The cells were suspended in 0.1 5M NaCI - 
0.1 M EDTA solution, and lysed with lysozyme. 
pronase and sodium dodecylsulfate (SDS). 
70 Chromosomal DNA was extracted from the 
lysate. and purified by a known method [J. Mar- 
mur, J. Mo[ . Biol .. 3, 208 (1961)] 

The chromosomal DNA 5 ug was incubated 
with restriction enzyme EcoRl at 37° C for 1 hour 
75 and the digested DNA was subjected to 1% 
agarose gel electrophoresis. 

This agarose gel was soaked in 0.2M NaOH - 
0.5M NaCI solution, so that DNA was denatured, 
and washed with tria-acetate-EDTA buffer for 10 
20 minutes 3 times. 

Then DNA on agarose gel was elec- 
trophoretically transferred to Zeta-Probe filter. - 
(Electrophoresis was done at 30V overnight). The 
filter was headed at 80°C for 3 hours after air- 
25 drying at room temperature for 30 minutes. 

Two Zeta-Probe filters on which DNA was im- 
mobilized by the above method were prehybridized 
with 4 x SSC [Blin et al.. Nucleic Acid Research . 3, 
2303 (1976)], 50x Den hart • [Wahl et al., 
30 Proc.Natl.Acad.Scie. . 76, 3683 (1979)], 1% SDS 
and sonicated salmon sperm DNA solution 10 
ug/ml at 60°C for 3 hours. 

Then, the filters described above were 
hybridized with the same solution containing 
35 100,000 cpm labeled P-37 probe or P-38 probe at 
40°C for 18 hours. 

The filters were air-dried after 4 times washing 
with 4 x SSC at 40°C 15-30 minutes, and then 
exposed to X-ray films for 48 hours. 
40 As a result, both hybridization with P-37 probe 
and with P-38 probe revealed that there exists a 
band hybridized on the position of about 2 kbp 
DNA fragment. 

45 

4) Insertion of chromosomal DNA fragment into 
vector and transformation of E.coli 

Chromosomal DNA 50 ug was digested by 
so incubating it with restriction enzyme EcoRl at 37° C 
for 1 hour and then applied to 1% agarose gel 
electrophoresis. The regions of the gel containing 
about 1 .8-2.2 kbp fragment was cut out, from which 
DNA fragment was extracted. 
55 The chromosomal DNA mentioned above was 

mixed with piasmid vector pUCl9 (Takara Shuzo)" 
digested with EcoRl and reacted with T4-DNA 
ligase at 17°C overnight. E.coli JM103 was trans- 
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formed with the reaction mixture [M. Mandel and A. 
Higa, J. Mol. Biol. , 53, 159 (1970)], inoculated on L- 
agar medium containing 5-bromo-4-chloro-3- 
indolyl-0-D-galactoside (X gel), isopropyi-jS-D- 
thiogalactoside (IPTG) and ampicillin and then cul- 5 
tivated at 37 °C overnight, from which sixty-four 
white colonies were picked up. 

5) Selection of transformant carrying SAGP gene io 

Each of the sixty-four transformed E.coli 
JM103 obtained above was inoculated in nitrocel- 
lulose filter on L-agar medium and cultivated at 
37°C for 6 hours. f5 

The filter was transferred to L-agar medium 
containing ampicillin, chloramphenicol and cultivat- 
ed at 37°C overnight. 

After cultivation, the filter was treated with 10% 
SDS for 5 minutes, with NaOH-1.5M NaCI for 3 20 
minutes, and with 1 R/l tris -HCI butter for 5 minutes 
so that bacteria on the filter were lysed. The DNA 
of them was immobilized on the filter by heating it 
at 80° C for 3 hours. 

Hybridization with the labeled P-38 probe de- 25 
scribed above and exposure to X-ray film revealed 
that two colonies among sixty-four colonies were 
hybridized with the probe. 

30 

6) Isolation of Plasmid DNA 

Two colonies revealed to have SAGP gene 
were cultivated and plasmid DNA was obtained by 
a known method [T. Maniatis, E. F. Fritsch, J. 35 
Sambrook, Molecular Cloning, p.86, Coid Spring 
Harbor Laboratory (1982)]. 

The plasmid DNA obtained from the two posi- 
tive clones were analyzed with several restriction 
enzymes. Consequently they represented the same 40 
restriction pattern, and were found to be identical to 
each other. They were named pSPl. 



7) Preparation of pSPl restriction map 45 

Piasmid pSP1 obtained above was digested 
with restriction enzymes, EcoRI, Hindi!!, Pstl, Clal, 
Ncol, Avail, Hpal, Hindi, Ahalli, Banl, Haeli, Mlul 
independently or in combination, and the digested so 
DNA fragments were applied to agarose gel or 
acrylamide gel electrophoresis to determine the 
length of fragments to prepare the restriction map 
ofpSPL 



Consequently, it was revealed that pSP1 was a 
plasmid wherein about 2 kbp foreign DNA fragment 
was inserted into the EcoRI site of pUC19. 



8) Determination of SAGP gene DNA sequence 

Based upon the restriction map described 
above, inserted DNA fragments of pSPl were fur- 
ther digested with restriction enzymes and cloned 
into pUC19, and using them, DNA sequencing was 
carried out by the dideoxy method [G. F. Hong, 
Biosci.Reoort . 12,907 (1982)]. Sequencing kit man- 
ufactured by Takara Shuzo. and M-4 or RV as a 
primer (Takara Shuzo) was used. The DNA se- 
quence of each fragment was determined to elu- 
cidate the entire 2l57bp sequence containing 
SAGP gene. 

9) Isolation of DNA fragment containing SAGP 
gene from the plasmid vector 

Plasmid pSP1 containing SAGP gene se- 
* quence obtained above 12.5 ag was digested with 
restriction enzyme EcoRI (Takara Shuzo) 50 units 
at 37° C for 2 hours. 

Thife digested mixture was fractionated with 5% 
polyacryiamide gel electrophoresis and the region 
of the gel containing DNA fragment of 21000 bp 
was cut out. 

DNA was extracted from the cut out gel with 
Tris-EDTA buffer. 

The extract was purified by washing with phe- 
nol and precipitation with ethanol to give 21000 bp 
DNA fragment containing SAGP gene. 



2. Construction of expression vector 

Two micrograms of pKK 223-2 (Pharmacia-PL), 
a plasmid DNA for construction of expression vec- 
tor containing trp and lac promotors (tac promotor) 
and ampicillin resistant gene was digested with 
restriction enzyme EcoRI 6 unit at 37 °C overnight 
which was then reacted with alkaline phosphatase 
at 37 6 C for 30 minutes to remove 5'terminal phos- 
phate residue. 

DNA was recovered from the mixture by wash- 
ing with phenol and precipitation with ethanol. 

pIN 111 Al plasmid DNA containing Ipp and lac 
promotor and ampicillin resistant gene [Inouye 
EMBO Journal, 1, 771 (1982)] was digested with 
EcoRI and 5'terminal phosphate was removed with 
alkaline phosphatase by the same method as 
above. The DNA fragment containing SAGP pre- 
pared in 1, 9 above was mixed with the vector 
DNAs described above, and the mixtures were 
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reacted with T4-DNA ligase (Takara Shuzo) at 4°C 
overnight to ligate them and produce expression 
vectors PIN (II SP and ptac SP, as illustrated in Fig. 
3. 



3. Transformation of host E.coli 

E.coli JM 103 was transformed by a known 
method [T. Maniatis, Molecular Cloning, p.250, 
Cold Spring Harbor Laboratory (1982)] with the 
expression vector described above. 

20 -30 clones for each transformant obtained 
above were cultured and their plasmid DNA was 
isolated by a known method (T. Maniatis, Molecular 
Cloning, p.86, Cold Spring Harbor Laboratory) and 
digested with various kinds of restriction enzymes. 
The digestion patterns were analyzed by agarose 
gel electrophoresis, so that clones of transformant 
E.coii JM103 (pIN til SP) (ATCC 67271), carrying 
pIN III SP and E.col? JM103 (ptac SP) (ATCC 
67272) carrying ptac SP were selected. 

4. Cultivation of transformed E.coli and induction of 
the expression of SAGP gene 

Each of the transformed E.coli cells carrying 
expression vector described above was inoculated 
in LB medium (T. Maniatis, Molecular Cloning, 
p.440, Cold Spring Harbor Laboratory) 100 ml t and 
incubated at 37° C. 

Growth of the culture was observed by measur- 
ing its optical density at 550 nm. When ODs» 
reached 0.2, IPTG(isopropy!-j9-D-thiogalactoside) 
0.1 mM (final concentration) was added to the 
medium and expression of SAGP was induced. 
The cultivation was continued. 

After the induction, a small amount of the me- 
dium was periodically taken out to examine produc- 
tion of SAGP. 

At the same time, Exoli harboring pKK 233-3 
or pIN HI Al was cultivated and IPTG was added. 
The medium was examined in the same way as 
controls. 



5. Confirmation of the production of SAGP 

Ceils in the sampled culture medium described 
above were harvested by centrifugation and the 
harvested cells were lysed by reacting with 
iysozyme 2 mg/ml at 0°C for 30 minutes, and 
adding 1% Triton-X100 (Wako Pure Chemicals Co., 
Ltd.). 



When the iysate was reacted with anti-SAGP 
rabbit antiserum by Ouchterlony method [O. Ouch- 
terlony, Prop . Allergy . 6, 30 (1962)] at 37° C over- 
night, an obvious sedimentation line was repre- 

5 sented by the antigen-antibody reaction if it was 
the lysed mixture of the cells containing pIN III SP 
or ptacSP, or SAGP isolated from S.pvooenes . In 
the case of mixtures containing pIN III AJ or pKK 
233-3, as control experiment no sedimentation line 

70 was observed. 

The lysed mixture was fractionated by 12.5% 
SDS-polyacrylamide gel electrophoresis, and the 
protein was transferred to a nitrocellulose filter by 
Western blotting method [W.N. Bumette, Anal . 

75 Biochem. . 112 , 195 (1981)] and immobilized. 

The filter was soaked in a buffer solution con- 
taining anti-SAGP rabbit antiserum at 37°C for 30 
minutes, and also at 40° C overnight Unreacted 
anti-SAGP antibody was washed away with the 

20 buffer, and the filter was soaked in a buffer solution 
containing alkaline phosphatase-anti-rabbit IgC anti- 
body at 37° C for 2 hours, ' whereby anti-SAGP 
antibody on the filter was reacted with it. 

When the filter was washed and soaked in the 

25 buffer containing BCIP (5-Bromo-4-chloro-3-indolyf- 
phosphate) for coloration, a band formed by the 
antigen-antibody reaction was observed in the 
same molecular weight (Ca. 47,000 daltons) region 
as SAGP isolated from S.pvoqenes in the cases of 

30 the transformant E.coli containing ptac SP or pIN III 
SP. 

Another E.coli containing pKK 233-3 or pIN 111 
Al, used as control experiment, did not represent 
such band. 

35 Accordingly, production of SAGP by E.coli con- 

taining expression vectors described above was 
confirmed. 



40 Example 2 

Purification and N-terminal amino acid se- 
quence of SAGP produced in E.coli JM103- 
(ptacSP) 

45 In order to confirm the amino acid sequence of 

the expressed protein in E.coli . the expressed pro- 
tein was purified and the N-terminal amino acid 
sequence was determined as follows. 

50 

1 ) Assay Method 

At each step of the purification process, the 
fractions containing SAGP were determined by an 
55 enzyme linked immunosorbent assay method. 



8 



13 



0 230 777 



14 



The samples and the standard SAGP were 
diluted by phosphate buffered saline (PBS), con- 
taining NaCI 8g, KCI 0.2g. NaHPO,.l2H r O 2.9g and 
KH 3 POi 0.2g per liter, and 100 ul of diluted solu- 
tions was put into each well of a 96-well flexible 
assay plate (Falcon® 3912) which had been 
washed by distilled water. After the plate was main- 
tained at 37°C for 2 hours, solutions in the wells 
were removed and 200 ul blocking solution (2% 
bovine serum albumin and 0.05% Tween® 20 in 
PBS) was poured into the each well and placed at 
37°C for 2 hours. The blocking solution was re- 
moved and then the assay plate was washed with 
PBST (0.05% Tween® 20 in PBS). Then 100 ul of 
the anti-SAGP rabit anti-serum which was diluted to 
10000-fold by PBST containing 1% BSA was put 
into each well and the plate was maintained at 
37° C for 2 hours. After the anti-serum solution was 
removed, the assay plate was washed with PBST. 
Then 100 ul of alkaline phosphatase linked anti- 
rabit-IGg anti-serum (Miles Scientific) which was 
diluted to 3000-fold with PBS was put into each 
well and the assay plate was maintained at 37° C 
for 2 hours. After the enzyme linked anti-serum 
solution was removed, the assay plate was washed 
with PBST. Then 100 ul of a substrate solution 
which contained 0.3% disodium p-nitrophenyl- 
phosphate in 10% diethanolamine buffer (pH 9.1) 
was put into each well and the assay plate was 
maintained at 37°C for 15 minutes. To stop' the 
enzyme reaction, 20 ul each of 1N NaOH was 
added. SAGP contained in sample solutions was 
identified and quantified by measuring the absor- 
bance of 450 nm light of the reaction mixtures. 



2) Cultivation of E.coli JM103(ptacSP) 

The cells of E.coli JM103(ptacSP) maintained 
on a LB-agar medium containing 50 ug/mi ampicil- 
lin were inoculated and cultivated in 5 ml LB me- 
dium containing 100 ug/ml ampicillin in test tubes 
for 14 hours at 37°C. Five mili-liters of the pre- 
culture broth was inoculated in 12 flasks of 2-liter 
volume each containing 500 ml LB-ampiciliin (100 
Ug/ml medium. Then the E.coli cells were cul- 
tivated at 37°C for 24 hours in a rotary shaker - 
(New Brunswick Scientific Inc. model G-25) at 280 
rpm. 

After the cultivation, the cells were harvested 
using a TOMY RS-20BH centrifuge equipped with a 
BH-9 rotor (10000 rpm, 10 min). 



The separated cells were resuspended in 200 
ml PBS and lysed by using a BRANSON 
SONIFIER. The ruptured cell "suspension was cen- 
trifuged (TOMY BH-9 rotor 10000 rpm, 10 min) and 
5 the pellet was removed. 



3) Ammonium sulfate precipitations 

10 The supernatant of the cell lysate was brought 
to 20% saturation with respect to ammonium sul- 
fate and was stirred for 2 hours at 4°C followed by 
standing for 2 hours. The solution was centrifuged - 
(Sorvall GSA rotor 10000 rpm, 15 min) and the 

75 pellet was removed. Then the supernatant was 
brought to 70% saturation with respect to ammo- 
nium sulfate and stirred for 2 hours at 4°C followed 
by standing overnight. The solution was centrifuged 
(Sorvall GSA rotor 10000 rpm, 15 min) and the 

20 pellet was recovered. 



4) Octyl-Sepharose CL-4B column chromatography 

25 The ammonium sulfate precipitate described 

above was dissolved in 6 ml of 10 mM potassium 
phosphate (pH 7.0) and applied to an Octyl- 
Sepharose CL-4B (Pharmacia) column (250 x 600 
mm), which had previously been equilibrated with 

30 10 mM potassium phosphate (pH 7.0). The 
chromatography was performed by 800 ml of 10 
mM potassium phosphate (pH 7.0) with ethylene 
glycol gradient (from 0 to 50%); flow rate 1 ml/min, 
8 ml/fraction. The fractions from No. 22 to No. 38 

35 were collected and dialysed against distilled water 
then lyophiiized. 

5) DE-52 ion-exchange chromatography 

40 

The SAGP fraction mentioned above was then 
dissolved in 5 ml of 10 mM potassium phosphate - 
(pH 7.0) and applied to an DE-52 (Whatman) col- 
umn (150 x 600 mm), which had been previously 

45 equilibrated with 10 mM potassium phosphate (pH 
7.0). Chromatography was performed by 800 ml of 
potassium phosphate (pH 7.0) with the salt con- 
centration gradient (from 10 to 500 mM); flow rate 
0.7 ml/min, 8 ml/fraction. The fractions from No. 48 

so to No. 58 were collected, dialysed against distilled 
water, and then lyophiiized. 
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6) DEAE-Sephadex ion-exchange chromatography 

The SAGP fraction mentioned above dissolved 
in 5 ml of 10 mM sodium phosphate (pH 7.0 and 
applied to an DEAE-Sephadex A-25 (Pharmacia) 
column (150 x 500 mm), which had been pre- 
viously equilibrated with 10 mM sodium phosphate 
(pH 7.0). Chromatography was performed by 300 
ml of 10 mM sodium phosphate with NaCI gradient 
(from 0 to 500 mM); flow rate 0.35 ml/min, 3 
ml/fraction. The fractions from No. 56 to No. 66 
were collected, dialysed against distilled water, and 
then lyophilized. 



7) TSK G-3000SW gel filtration 

The SAGP fraction mentioned above was dis- 
solved in 2 ml of PBS and 200 ul of the solution 
was applied to a TSK G-3000SW (Toyo Soda) gel 
filtration HPLC column (7.5 x 600 mm). Chromatog- 
raphy was performed by using PBS as an eluant; 
flow rate 0.3 ml/min. The eluate whose retention 
time was from 53.5 min to 56.8 min was recovered. 
This gel filtration was repeated once. 

The recovered solution was desalted by using 
a Sephadex PD-10 (Pharmacia) column, and then 
iyophilized. 



8) SDS-polyacryiamide gel electrophoresis 

The recovered eluates mentioned above was 
then applied to a 12.5% SDS-polyacryiamide gel 
electrophoresis and a gel piece which contained a 
protein whose mobility corresponded to a molecu- 
lar weight 4.7 kg was cut out 

The protein in the gel piece was elec- 
trophoretically eluted by a known method [M. W. 
Hunkapiller et al, Methods in Enzymology, 91 , 227 
(1983)]. 



9) N-terminal amino acid sequencing 

The protein recovered from the SDS- 
polyacryiamide gel mentioned above was applied 
to a Protein Sequencer model 470A (Applied 
Biosystems). 

The deduced N-terminal amino acid sequence 
was as follows. 

Thr-Aia-Gln-Unk-PrcHle-Unk-Val-Tyr 

The amino acid sequence was in agreement 
with the N-terminal portion of SAGP produced by 
S.pvooenes Su without the 4th and 7th amino 
acids; th y are threonine and histidine in the native 
SAGP and both of them are hardly detectable by 
the adopted sequencing method. Since methionine 



was also detected as the N-terminus together with 
threonine by the amino acid sequencing, it was not 
denied that the SAGP produced by Ecoli con- 
tained a protein whose N-terminal methionine was 
5 not processed. 

Thus it was confirmed that the cloned gene 
was correctly translated to the amino acid se- 
quence of SAGP. 

10 

Claims 

1. DNA coding for an antitumor protein pro- 
duced by Streptococcus pyogenes . 
75 2. DNA according to Claim 1, which is char- 

acterized by coding for the antitumor protein speci- 
fied by the amino acid sequence shown in Figure 
2. 

3. DNA according to Claim 2, which is char- 
20 acterized by having the base sequence shown in 

Figure 2 as coding for the antitumor protein. 

4. DNA containing a DNA sequence as claimed 
in Claim 1 , 2 or 3. 

5. DNA according to Claim 4, which is seif- 
25 replicating. 

6. DNA according to Claim 5, which is speci- 
fied as plasmid pSP1 . 

7. Expression vector which harbors DNA as 
claimed in any one of the preceding claims and 

30 which expresses the antitumor protein produced by 
S.pvooenes in host cells. 

8. Expression vector according to Claim 7, 
which is specified as ptac SP. 

9. Expression vector according to Claim 7, 
35 which is specified as pIN 111 SP. 

10. Expression vector according to Claim 7, 8 
or 9, which is characterized in that the host cell is 
E.coli . 

11. A microorganism containing DNA coding 
40 for an antitumor protein produced by Streptococ- 
cus pyogenes . 

12. Microorganism according to Claim 11, 
which belongs to E. coli: 

13. Microorgansim according to Claim 12, 
45 which is E.coli JM 103 (pSP1) (ATCC 67270). 

14. A method for producing an antitumor pro- 
tein which comprises cultivating transformant cells 
transformed with an expression vector as claimed 
in Claim 7, 8, 9 or 10. 

so 15. A method according to Claim 14, which 

comprises cultivating E.coli JM 103 (pIN ill SP) - 
(ATCC 67271) or E.coli JM 103 (ptac SP) (ATCC 
67272) . 
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Fig. 2- 1 

10 20 30 40 50 60 

GAATTCACAAAAACCAATTAGAAAGAGGTAATTCTTATGACTGCTCAAACACCAATTCAT 

MetThrAlaGlnThrProIleHis 

70 80 90 100 110 120 

GTTTATTCTGAAATTGGAAAATTAAAAAAAGTGTTGCTCCACAGACCCGGTAAAGAAATT 
ValTyrSerGluIleGlyLysLeuLysLysValLeuLeuHisArgProGlyLysGluIle 

130 140 150 160 170 180 

GAAAATCTTATGCCAGACTACTTGGAACGCTTGTTGTTTGATGACATTCCATTCTTAGAG 
GluAsnLeuMetProAspTyrLeuGluArgLeuLeuPheAspAspIleProPheLeuGlu 

190 200 210 220 230 240 

GATGCCCAAAAAGAACATGATGCTTTTGCCCAAGCTTTGCGTGACGAAGGTATTGAAGTC 
AspAlaGlhLysGluHisAspAlaPheAlaGlnAlaLeuArgAspGluGlylleGluVal 

250 260 270 280 290 300 

TTGTACCTTGAGACCTTAGCTGCAGAATC1CTTGTCACTCCAGAAATTCGCGAAGCTTTC 
LeuTyrLeuGluThrLeuAlaAlaGluSerLeuValThrProGluIleArgGluAlaPhe 

310 320 330 340 350 360 

ATCGATGAATACCTAAGTGAAGCTAATATCCGTGGAAGAGCTACTAAAAAAGCCATTCGT 
IleAspGluTyrLeuSerGluAlaAsnlleArgGlyArgAlaThrLysLysA'lalleArg 

370 380 390 400 410 420 

GAATTGTTGATGGCTATCGAAGACAATCAAGAATTGATTGAAAAAACCATGGCAGGAGTT 
GluLeuLeuMetAlalleGluAspAsnGlnGluLeuIleGluLysThrMetAlaGlyVal 

430 440 450 460 470 480 

CAAAAATCTGAACTTCCAGAGATCCCAGCATCTGAAAAAGGCTTGACTGACTTGGTTGAA 
GlnLysSerGluLeuProGluIleProAlaSerGluLysGlyLeuThrAspLeqValGlu 

490 500 51C 520 530 540 

TCCAATTACCCATTTGCCATCGACCCAATGCCAAACCTTTATTTCACACGGGACCCATTC 
SerAsnTyrProPheAlalleAspProMetProAsnLeuTyrPheThrArgAspProPhe 

550 560 570 580 590 600 

GCAACTATCGGTACAGGTGTTTCTCTTAACCACATGTTCTCAGAAACACGTAACCGTGAA 
AlaThrlleGlyThrGlyValSerLeuAscHisMetPheSerGluThrArgAsnArgGlu 

610 620 630 640 650 660 

ACCCTTTATGGTAAATACATTTTCACTCATCACCCAATTTATGGGGGTGGCAAAGTGCCT 
ThrLeuTyrGlyLysTyrllePheThrHisHisProIleTyrGlyGlyGlyLysValPro 

670 680 690 700 710 720 

ATGGTTTATGACCGTAATGAAACCACTCGCATTGAAGGTGGGGACGAACTTGTTCTTTCA 
MetValTyrAspArgAsnGluThrThrArglleGluGlyGlyAspGluLeuValLeuSer 
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Fig. 2-2 

730 740 750 760 770 780 

AAAGATGTGCTTGCGGTTGGTATTTCTCAACGTACAGATGCTGCTTCTATTGAAAAATTG 
LysAspValLeuAlaValGlylleSerGlnArgThrAspAlaAlaSerlleGluLysLeu 

790 800 810 820 830 840 

. TTGGTTAACATCTTTAAACAAAACCTTGGCTTCAAGAAAGTATTGGCCTTTGAATTTGCA 
LeuValAsnliePheLysGlnAsnLeuGlyPheLysLysValLeuAlaPheGluPheAla 

850 860 870 880 890 900 

AATAACCGTAAATTTATGCACTTAGACACTGTCTTTACCATGGTTGACTATGACAAATTT 
AsnAsnArgLysPheMetHisLeuAspThrValPheThrMetValAspTyrAspLysPhe 

910 920 930 940 950 960 

ACCATTCACCCAGAAATTGAAGGAGACCTTCGTGTTTACTCTGTCACTTACGACAATGAA 
ThrlleHisProGluIleGluGlyAspLeuArgValTyrSerValThrTyrAspAsnGlu 

970 980 990 1000 1010 1020 

GAACTTCATATCGTTGAAGAAAAAGGTGATTTAGCAGAACTTCTTGCTGCTAACCTTGGT 
GluLeuHisIleValGluGluLysGlyAspLeuAlaGluLeuLeuAlaAlaAsnLeuGly 

1030 1040 1050 1060 1070 1080 

GTTGAAAAAGTTGACCTTATCCGTTGTGGTGGTGACAACTTAGTAGCAGCAGGTCGTGAA 
ValGluLysValAspLeuIleArgCysGlyGlyAspAsnLeuValAlaAlaGlyArgGiu 

1090 1100 1110 1120 1130 1140 

CAATGGAACGATGGTTCTAACACCCTTACTATCGCACCAGGTGTGGTTGTGGTTTATAAC 
GlnTrpAsnAspGlySerAsnThrLeuThrlleAlaProGlyValValValValTyrAsn 

1150 1160 1170 1180 1190 1200 

CGTAACACCATTACCAATGCTATTCTTGAATCTAAAGGCTTGAAATTGATCAAGATTCAC 
ArgAsnThrlleThrAsnAlalleLeuGluSerLysGlyLeuLysLeuIleLysIleHis 

1210 1220 1230 1240 1250 1260 

GGAAGTGAATTGGTTCGCGGTCGTGGTGGACCTCGTTGTATGTCAATGCCATTTGAACGT 
GlySerGluLeuValArgGlyArgGlyGlyProArgCysMetSerMetProPheGluArg 

1270 1280 1290 1300 1310 1320 

GAAGATATTTAATAAGCTATGGTAAAGGTGGTTATAGGTCAGAAGCCTTTTTAAAGGGCA 
GluAspIle 

1330 1340 1350 1360 1370 1380 

GCTAGTGTTTATCTTCGCTTCTGTGTCTTTGTCCTTCAATGAAATTTTGTCATGACAGAT 

1390 1400 1410 1420 1430 1440 

AAATTTGATGCCAATGACGAAACAAGAACGGTTTATGCAGTCGTTTATGACAATGACCAG 

1450 1460 1470 1480 1490 1500 

CCCGTTTCAACAGGACAATTTTTAGCTGAAACGAAAATAGAAGCACGATTGACACGCATT 
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Fig. 2-3 
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GTAACCTTAGCAGATTATTGTGGTTGCGGTTATGGTGCCAAAGTCACTGAAGCGCTAGAA 

1570 1580 1590 1600 1610 1620 

ACTTATACCAGACGAGAAGGCTTTTACCAACTAACCATTCACAGTGAACTGACAGCACAA 

1630 1640 1650 1660 1670 1680 

ACCTTTTATGAAAACCTAGGTTATCAGACCTATGGTTCCAAGTATTTAGAAGATGGTGAG 

1690 1700 1710 1720 1730 1740 

TATTGTCAATCCCTTGTTAAAACCATTCTTAAATGGGAGAAGAATATGGACATAGCAATG 

1750 1760 1770 1780 1790 1800 

CTAATTGCGATTGTTGGTGGTCTATTAGGCTGCTATCTCTATCTCACAAAAAATAATGAA 

1810 1820 1830 1840 1850 1860 

CCCAAAGATTAAGTTAATACTCGAAGGAGACAATAGATGACACAAGTATTTCAAGGACGT 

1870 1880 1890 1900 1910 1920 

AGCTTCCTAGCAGAAAAAGATTTTACACGCGCTGAATTAGAATACCTTATTGATTTTTCA 

1930 . 1940 1950 1960 1970 1980 

GCTCATTTGAAAGATTTGAAAAAACGTGGTGTGCCTCATCACTACTTAGAAGGTAAAAAC 

1990 2000 * 2010 2020 2030 2040 

ATTGCCCTCTTGTTTGAAAAAACATCAACTCGTACGCGTGCAGCTTTTACAACAGCAGCC 

2050 2060 2070 2080 2090 2100 

ATTGACCTAGGTGCTCACCCAGAATACCTCGGTGCCAATGACATCCAACTTGGTAAAAAA 

2110 2120 2130 2140 2150 

GAATCAACAGAAGACACTGCTAAAGTATTGGGTCGTATGTTTGATGGGATTGAATTC 
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